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THE GENETIC LOCUS UGTl AND A 
MDTATION THEREIN 

BACKGROTTWD OF Tuv TNVENTTnw 

Technir al Fi«»Tri 

5 The subject invention relates to the 

isolated gene locus UGTl, a family of UDP- 
glucuronosyl transferase isozyries encoded by this 
locus, and to the uses of both the locus as well as 
the zsczy^nes themselves. m particular, the locus 
encodes at least six transferase isoforms two of 
whxch metabolize bilirubin. The six isoforms share 
four common exons. The present inventors have 
discovered that in a patient having Crigler-Naj jar 
syndrome (CN) Type i, a mutation is present in the 
15 second common exon. 



20 



25 



30 



Background Tn^',^r^^^^.^^ 

Bilirubin, an extremely lipophilic-like 
and toxic metabolite formed by the ongoing turnover 
Of hemoproteins (most significantly hemoglobin) is 
normally metabolized to excretable derivatives by 
con3ugation of one or both of the carboxyl groups of 
Its propionyl side chains (c-8 or c-12) with 
glucuronic acid (Ostrow et al., Biochem. .t, i20:3ll- 
327 (1970)). Both the formation of the C-8 or C-12 
n.onoglucuronide, or the c-8, c-i2 diglucuronide, is 
catalyzed by hepatic bilirubin UDP-glucurono- 
syltransf erase (Ritter, et al., j. Biol, rh.^ 
266:1043-1047 (1991)). Inadequate bilirubin 
glucuronidation is commonly observed in neonates 
(Roy Chowdhury, et al. in T tie Metabolic n.^-i. 
lJ^ih^n^m.^isms^ 1367-1408 (scriver et al. , eds.) 
McGraw-Hill, New York (1989)), breast-fed infants ' 
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(Foliot, et al., Pediat^, 10:594-598 (1976)) 

and as a consequence of liver diseases (Bloomer, et 
^ra. J. Dig. Pis. I9:9»i4 (1974)). 

Unconjugated bilirubin in cholestatic 
liver diseases is associated with the accumulation 
of serum bilirubin and the appearance of the 
clinical features of jaundice. Coincident with the 
continuous and marked elevations (>20 mg%) in serum 
bilirubin concentrations seen in serious diseased 
states is the risk of kernicterus, the degenerative 
changes associated with bilirubin deposition in the 
brain (Roy Chowdhury, et al. 1367-1408 (1989)) in 
■ The Metahotic Basis of mh^rit^^ n^^..^^ supra 
(1989) . Milder elevations in the serum level of 
this heme-derivative are seen in approximately half 
Of all neonates causing clinical jaundice which is 
due to delays in the developmental onset of 
bilirubin transferase activity normally occurring l 
to 5 days after birth. 

Although the short-term maintenance of 
jaundiced neonates is a major concern of pediatric 
clinicians, children with inborn errors in bilirubin 
glucuronidation (totally lacking this function) 
present an even greater challenge. 

Heritable disorders of bilirubin 
metabolism result in hyperbilirubinemia. These 
disorders include those resulting in predominantly 
unconjugated hyperbilirubinemia: Gilbert's syndrome 
and Crigler-Najjar syndrome. Types I and II. 
Deficient activity of bilirubin glucurono- 
syltransf erase has been described in each of the 
three unconjugated hyperbilirubinemias. Bilirubin 
glucuronosyltransf erase is absent in Crigler-Najjar 
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syndrome, Type I and is partially deficient in 
Crigler-Najjar syndrome, Type II, and Gilbert's 
syndrome , 

Approximately 6% of the population suffer 
from Gilbert's syndrome. These patients are • 
typically young adults who present with mild, 
predominantly unconjugated hyperbilirubinemia, 
having serum bilirubin levels which are 
approximately 5-fold elevated over those of 
individuals not suffering from the syndrome. These 
patients are always borderline jaundiced (jaundice 
IS a reflection of elevated bilirubin) , but are 
asymptomatic in the absence of other medical 
problems, such as influenza or biliary artresion. 
Previously, there has been no way to establish 
Gilbert's syndrome absent liver biopsy to measure 
bilirubin levels (which may alternatively be 
elevated due, for example, to cirrhosis of the liver 
or hepatitis) . Thus, these patients usually suffer 
high morbidity and incur great expense, having seen 
many physicians and having undergone many diagnostic 
tests. Thus, a simple assay specific for bilirubin 
UDP-glucuronosyltransf erase is needed to efficientry 
and reliably diagnose Gilbert's syndrome. 
Administration of phenobarbital or other microsomal 
enzyme inducers is known to reduce the 
hyperbilirubinemia in Gilbert's syndrome. 

In contrast to patients suffering from 
Gilbert's syndrome, most patients suffering from 
Crigler-Najjar syndrome. Type I, die between the 
ages of 3 and 20. This syndrome is characterized by 
chronic nonhemolytic unconjugated hyperbilirubinemia 
m which hepatic bilirubin UDP-glucurono- 
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syltransferase activity is absent (Crigler et al.. 
Pediatrics 10:169-180 (1952)). Crigler Najjar 
syndrome ~{CN) Type I, is inherited in an autosomal 
recessive pattern (Childs et al. , Pediatrics . 23:903 
(1959); Szabo et al. , Acta Paediatr. Hnng. 4:153 
(1963); and Arias et al.. Am. J. M^ri, 47:395-409 
(1969)). The cause of death is 

the deposit of unconjugated, lipid-soluble bilirubin 
in the gray matter of the Central Nervous System, 
leading to seizures, kernicteric damage, and 
ultimately death. The only effective long-term 
treatment that has been used successfully in 
patients with Crigler-Naj jar syndrome, Type I, is 
liver transplantation, although this procedure is 
15 not without risk in these individuals. The 

fatalities associated with this condition confirm 
the need to detoxify bilirubin. Characterization of 
the bilirubin transferase gene is, therefore, 
critical to making progress in the development of 
20 therapies for the Type I disease. 

Crigler-Naj jar syndrome, Type II, is 
phenotypically similar to Crigler-Naj jar syndrome, 
Type I, except that it is almost always clinically 
benign and the serum bilirubin concentration is 
25 usually below 20 mg/dl. Administration of 

phenobarbital or other microsomal enzyme inducers is 
known to reduce the hyperbilirubinemia in Crigler-- 
Naj jar syndrome, Type II. 

Obviously, in view of the above 
discussion, there is a need to understand the locus 
encoding the UDP-glucuronosyltransf erase isozymes, a 
need fulfilled by the present invention. 
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All U.S. patents and publications referred 
to herein are hei-*»hv ^^^^j^cea 
are hereby incorporated by reference. 

SUMMARY OF THE TNVBWT^rnM 

isolates ^""^ invention relates to an 

this rocuT ""^"^ ^'^""^ 

this locus are 6 transcriptional units or DNA 

d"::r i:: - ..i. ...... 

trsnc=^« . seganent encodes a unique 

transferase isofonn. 

^= u. "^^^ Present invention also encompasses . 

:zT:r t '"'^^^^^ ^ 

exon 1 Of each Of the 6 DH4 segments. |f 
to a invention relates 

an IZt IX ^ PoXypeptiae having 

an a.-n„o acxa sequence corresponding to hUBan 

1 K °°^-'^"="""°-^ltransferase type I or to 
hu^an MliruMn >»P-gluour=„osyltransfe^se t^pe I. 
hunan , ■ ""^""^ invention also includes the 

B«. ..CO :;=:s"ioTu:r j:::r 

...o. Additionally, the present invention 

irZZr -1-^- -prising all 

proc^rvot '° ^ - 

transT " ^^--^^tic cells transformed or 
transfected with these reccnhinant DNA molecules 
Of . ™^ P"^^^nt invention also includes uses 

oTat Lst im"" ^ P-^e 

18 bases for the diagnosis of a syndrce I 
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selected from the group consisting of Gilbert's 
syndrome, Crigler-Naj jar syndrome Type i, and 
Crigler-Najjar syndrome Type ii comprising a 
specific sequence of a cDNA encoding a mammalian 
5 bilirubin DDP-glucuronosyltransf erase enzyme, mutant 
form, or variant thereof, said diagnostic probe 
identifying the corresponding mammalian genomic DNA. 

The invention also encompasses, in 
particular, a diagnostic probe for the detection of 
10 Crigler-Najjar, Type I syndrome, wherein said probe 
comprises a DNA sequence corresponding to the 13-bp 
nucleotide deletion present in exon 2 of the UGTl 
complex. 

The invention also relates to a diagnostic 
probe Of at least is bases for the diagnosis of a 
syndrome selected from the group consisting of 
Gilbert's syndrome, Crigler-Najjar syndrome Type i 
and Crigler-Najjar syndrome Type II comprising a DNA 
sequence having sufficient homology to a cDNA 
encoding a mammalian bilirubin UDP-glucurono- 
syltransferase form, mutant form, or variant thereof 
to Identify the mammalian genomic DNA corresponding 
to said CDNA. 

Furthermore, the invention also includes a 
PGR primer pair designed to amplify a specific 
portion of mammalian genomic DNA, wherein each 
member of said primer pair is from 17 to 20 bases 
and further wherein each member of said primer pair 
comprises a specific sequence of a cDNA encoding a 
mammalian bilirubin UDP-glucuronosyltransferase 
form, mutant form, or variant thereof. 

Additionally, the invention includes a 
diagnostic assay for a syndrome selected from the 
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group consisting of Gilbert's disease, Crigler- 
Najjar syndrome Type I and Crigler-Naj jar syndrome 
Type II comprising the steps of: 

(a) amplifying a sequence of mammalian 
genomic DNA using, for example, the PGR primer pair 
described above to produce multiple DNA copies of 
said mammalian genomic DNA, and 

(b) exposing said amplified DNA 
according to (a) one of the diagnostic probes 
described above under hybridization conditions to 
Identify a specific mammalian genomic sequence 
Characteristic of Gilbert's disease, Crigler-Naj jar 
syndrome Type I or Crigler-Naj jar syndrome Type II. 

Alternatively, one couid amplify normal 
and defective genomic DNA using PCR with one of a 
pair Of normal (i.e., not altered to match the 
defect) primers located over the defect, pgr 
product would be expected from template genomic DNA 
from a normal individual and from both heterozygous 
parents, but not from genomic DNA from the 
homozygous defective individual. The failure to 
generate product with the homozygous defective 
genome is diagnostic. 

Thus, in view of the above, the invention 
includes a diagnostic assay for the detection of 
Crigler-Najjar, Type I syndrome in a patient 
comprising the steps of: 

a) altering one of a pair of normal PCR 
primers such that said altered primer contains the 
deletion mutation present in exon 2 of UGTlF of the 
genome of Crigler-Najjar, Type I patients; 

b) adding said primers to a DNA sample of 
a patient suspecting of having said syndrome; and 
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c) determining whether hybridization 
occurs such that a PGR product results, lack of said 
product indicating presence of Crigler-Naj jar. Type 
I syndrome. 

It should be noted that the same type of 
assay could potentially be utilized with respect to 
the diagnosis of any type of syndrome or disease 
characterized by a deletion mutation in a gene. 

Additionally, the present invention 
includes the amino acid sequences as shown in Figure 
9. one sequence corresponds to human bilirubin UDP- 
glucuronosyltransf erase. Type I (HUG-Brl) . The 
other sequence corresponds to human bilirubin UDP- 
glucuronosyltransf erase. Type ll (HUG-Br2) . 



BRIEF DESCRIPTTOW OF THK DRAWINGS 



Figure 1. Nucleotide sequence of the 
exons , intron-exon injunctions , and 5 ' - and 3 • - 
flanking regions of the human UGTl gene complex. 
The exonic sequences (in capitals) were determined 
by using a nested sequencing strategy with double- 
stranded plasmid DNA as template as described in 
Ritter et al., supra (1990) and by comparison to the 
HUG-Brl, HUG-Br2, and HLUG PI cDNA sequences. The 
TATA boxes are underlined, and the polyadenylation 
consensus signal sequence is in bold. The predicted 
amino acid translations (in single letter code) of 
the open reading frame contained in each exon 1 of 
UGTIA through UGTIF, and UGTl common exons, 2-5, are 
shown below the nucleotide data. 
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Figure 2. a, represents a schematic 
diagram of the UGTl gene locus showing the 
exon/intron arrangement. UGTl is a complex of 
nested transcription units featuring multiple first 
exons of UGTIA through UGT.lF and their associated 
promoters (right-angled arrows) , and four common 
exons (2-5) . The solid blacJ^ lines indicate 
intervening DNA. The complex is not drawn to scale 
and covers at least 85-kb region of chromosome 2 
(Harding et al. , Ann. Hum. r.^r,^^ 54:17-21 (1990)). 
First exons are spaced an average of 7-kh from each 
other and the four common exons are clustered in a 
6-kb region at the 3- end of the complex. 

B, represents the predicted primary 
transcripts produced from the UGTl gene locus. Wavy 
black lines indicate segments of RNA excised from 
the transcript during splicing and polyadenylation. 
The RNAs encode a family of UDP-glucurono- 
syltransferasGs with unique amino termini (286-289 
residues) and identical carboxyl termini (246 amino 
acids) . The products of UGTlA and UGTID are the 
bilirubin UDP-glucuronosyltransferases encoded by 
HUG-Brl and HUG Br2 , respectively (Ritter et al., 
Biol. Cham. 266:1043 (1991)). 

Figure 3. Detailed illustration of the 
UGTl gene locus, a, Three overlapping clones 
[44(39.8 kb), 37-1 (43.1 kb) , and 42-1 (36 kb) ] were 
isolated from a leukocyte DNA cosmid library 
constructed in Sp cos2 (Heilig et al., Nuclejc Ao^^c. 
Res^, 15:9129 (1987)) and transformed into e. coli 
490A cells. Approximately 2 x lo' colonies were 
analyzed by the Southern blot technique via 
hybridization to the unique region of either HUGBrl 
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(1-470-bp fragment) or HUG-Br2 (1-450-bp fragment) 
CRitter et al., J. Biol. Chs^, 266; 1043 (1991)). 
The probes were labelled by random primed synthesis 
m the presence of [a"P]dCTP. Hybridization 
conditions used were previously described (Ritter et 
al., J. Biol. Chftm. , 266;l04i3 (1991)). Clones 44 
and 37-1 hybridized to the HUGBr2 -specif ic probe and 
not to the HUG-Brl- or to the common end-one. clone 
42-1 hybridizecS to the HOGBrl- and the 3' common 
end-[i836 to 2167 bp of the cDNAs] probes (Ritter et 
al., J. Bio3. Chein 266:1043 (1991)). The sites 
for 6 different restriction endonucleases and their 
relative positions among the three clones were 
determined as shown in Figure 3B. Fourteen 
15 contiguous subclones in pBluescriptSK+ were made 
from the three overlapping cosmid clones. B, 
Positive subclones which hybridized to HUG-Brl-, 
HUG-Br2-, or to the common end-probes were mapped in 
greater detail and sequenced as previously described 
20 (Ritter et al. supra, (i99i) ) . (B,C) Four common 
exons (solid boxes) encoding the common region of 
the CDNAs and six exon Is of UGTlA to UGTIF 
(patterned boxes specifying uniqueness) were 
identified and are shown according to their 
25 distribution in the cosmid clones, introns are 

shown as lines. Endonucleases used were: PstI (?) , 
EcoRI (E) , Xhol (Xh) , Xbal (X) , clal (C) , BamHI (B) ' 
Hindlli (H) , and Sail (S) . Exon i of UGTIBP or 
UGTlc contains a Kpnl or a SphI site, respectively, 
which was used to establish orientation. E, Cap 
sites for the UGTlA, UGTlD, and UGTIF mRNAs 
(outlined C or G) were determined by primer 
extension. The cap sites (bold C) were predicted 
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for the UGTIBP, UGTIC, hand UGTl mRNAs based on 90% 
sequence identity to exon l of UGTID in the coding 
and in at least 200 bp of the flanking regions. 
Each Of the six unique exon Is contained a 5 • TATA 
box M . c. According to the maps for endonucleases 
and sequence data, the UGTl gene locus kb. D, The 
nested arrangement of the 6 TATA boxes immediately 
upstream of a predicted cap site of the mRNA and the 
location of a donor splice motif at the 3 • 
exon/intron junction of these exons (data not shown) 
indicate that six transcriptional initiation events 
by RNA polymerase II take place in a nested fashion. 
Differential splicing of the exon 1 in each primary 
transcript to the common exons is predicted to 
produce 6 different mRNAs. 

Figure 4. A, Autoradiograms of Sanger 
nucleotide sequencing reactions of normal and CN, 
Type I patient DNA. Plasmid DNA containing the 
entire UGTl exon 2 from a normal (A) and the CN, 
Type I patient FB fB) was sequenced using the 
Bluescript primer, KS. The sequence ladders were 
generated by electrophoresis through a denaturing 6% 
polyacrylamide gel, after which the gels were 
transferred to blotting paper, dried, and exposed 
overnight to x-ray film to generate the 
autoradiogram shown. A, C, T and G are lanes 
corresponding to separate reactions with added 
dideoxy derivatives of ATP, CTP, TTP and GTP, 
respectively. The boundary between the first intron 
3 0 and the second exon is shown. The shaded vertical 
line designates the 13-bp segment in normal DNA 
which is deleted from the DNA of the CN, Type I 
patient. 
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B, Predicted effect of the 13-bp deletion 
on the bilirubin UDP-glucuronosyl transferases 
encoded by UGTIA and UGa?lD, in a normal individual, 
the bilirubin transferases encoded by UGTlA 
5 (HUG-Brl) and UGTID (HUG-Br2) have different ' 

N-termini (288 or 289 amino acids, respectively) and 
identical C-termini (246 amino acids) . in CN, Type 
I patient FB, these proteins are predicted to be 
severely truncated; N and C, the amino and carboxyl 
10 termini of bilirubin transferase; CS (unknown 

function) and MAD (membrane anchoring domain) are 
conserved domains present in all known transferases. 

Figure 5. pcr analysis of genomic DNA 
from family members. Samples of pgr reactions 
generated from genomic DNA (O.l nq) of the CN, Type 
I individual, FB (Lane 3), his parents (Lanes 2 and 
4) , and an unrelated normal individual (Lane 1) with 
oligonucleotides PXG3 and J127 as primers were 
analyzed by electrophoresis through nondenaturing 
20 polyacrylmide (6%) for 5 hr. at 50V in 0.5X THE. 
The gel was stained with ethidium bromide, 
irradiated with UV, and photographed. The primers 
amplify a fragment representing the last 45-bp of 
the first intron and the first l54-bp of exon 2. 
25 Lane 5 is a sample of a control reaction in which 

DNAs from FB (homozygous affected) and the unrelated 
normal (homozygous normal) were mixed in 
approximately equal proportions and subjected to the 
same PGR analysis. STD is a molecular size 
standard, l Kilobase Ladder (Bethesda Research 
Laboratories) . Molecular sizes (in base pairs) are 
shown to the right of the figure. 
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Figure 6. A, Northern blot analysis of 
human liver mRNA and kidney and skin total RNA 
samples using UGT1A-, UGT1D-, and UGTlF-specif ic 
probes. Triplicate sets of two different liver mRNA 
(2^ig each) samples, and one kidney and one skin 
total RNA (25 ng each) samples were prepared for 
analysis according to published procedures. (Ritter 
®^ ^1-' J. Biol. Cham., 265:7900 (1990)). 
corrections for differences in probe lengths were 
made in calculations of relative amounts of mRNA. A 
human B-actin probe was hybridized (Church et al., 
Proc. Natl ftcad. Sci nsA 81:1991 (1984)) to each 
filter to normalize the amount of RNA applied to the 
gel. 

B, Southern blot analysis of 
endonuclease-digested human genomic DNA using three 
different exon 1- and the common exon 5-specific 
probes. Human lymphocyte DNA was digested as 
indicated, electrophoresed in a 0.8% agarose gel, 
and blotted on to ZetaBind~ membrane to make a 
duplicate set. Probes are defined at the top of the 
panel, one filter was first exposed to a UGTIF- 
probe, stripped, and rehybridized to a UGTIA- 
specific one. The second filter was exposed to the 
exon-5 probe and then to a UGTID one in a similar 
manner. The hybridization conditions were as 
described (Ritter et al., supra . (1990) ) . 

Figure 7. The conjugation of glucuronic 
acid (donated by UDP-glucuronic acid) to bilirubin- 
IXa to produce water-soluble IXaC12 Bilirubin-J3- 
glucuronide. 

Figure 8. Three different Bilirubin-ft- 
glucuronide conjugates: bilirubin-IXaC-e 
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monocon jugate, bilirubin-rXaC-12 monocon jugate, and 
bilirubin-IXct dicon jugate. 

Figure 9. The nucleotide and deduced 
amino acid sequences of HUG-Brl and HUG-Br2. 
Nucleotide and deduced amino acid residues in HUG- 
Brl which differ from HUG-Br2 are shown in reverse 
font. The start and stop codons are designated by 
open boxes. Putative membrane-insertion signal and 
membrane-anchoring peptides are indicated by dashed 
and solid lines, respectively. Predicted 
asparagine-linked glycosylation sites are denoted by 
solid triangles and consensus polyadenylation 
signals by dashed boxes. 

Figure 10 (Panel A). Hybridization of 
15 HUG-Brl and HUG-Br2 to liver mRNA isolated from 
normal human and from a Crigler-Naj jar syndrome 
patient. Duplicate sets of mRNA samples (2 ug each) 
from two normal (N) and a type I Crigler-Naj jar 
patient (CN) were electrophoresed as described 
(Ritter, et al. J. BioT . ru^y, 265: 7900-7906 
(1990)) and transferred onto ZetaBind~ membrane. 
One filter was hybridized to a HUG-Brl-specif ic 
probe and one to a HUG-Br2-specif ic probe. The 
amount of mRNA applied to the gel was normalized by 
hybridization to a 660-bp human cyclophilin probe 
(bottom of panel A) . All mRNA levels were 
quantitated by densitometric scans of appropriately 
exposed X-ray film, and the areas under the curves 
were then evaluated by computer analysis. 
^° Figure lo (Panel B) . Hybridization of 

HUG-Brl and HUG-Br2 to liver mENA isolated from the 
ErythrocebtiF; p fttn s monkey. Duplicate sets of mRNA 
(2 ug) isolated from either untreated monkey or one 
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maintained on 15 ing% phenobarbital (PB) in the 
drinking water were analyzed as in Figure 6A. 

Figure li. Expression of Bilirubin 
Transferase Activity in Mouse Microsomes and COS-l 
Cells Transfected with either pHUG-Brl and pHUGBr2: 
Bilirubin glucuronidation was measured using 
"C-bilirubin and UDP-glucuronic acid, and the 
glucuronides were transesterif ied to their 
corresponding methyl esters and subjected to TLC. 
The TLC plates were exposed to X-ray film (10 days) . 
Panels A and B represent two different experiments. 
Bilirubin glucuronides formed by mouse microsomes 
(2.9 mg) in the presence (A, Lane 3) and absence (A, 
Lane 4) of UDP-glucuronic acid are shown, 
sensitivity of the reaction product generated in 6.5 
hr. to glucuronidase treatment for 1 hr. at 3 7»c is 
shown in A, Lane 2. The methyl esters of bilirubin 
corresponding to the iXaCS and the lXaC12 isomeric 
monoglucuronides (BMEs) , the diglucuronide (BDE) , 
and unconjugated bilirubin (Br) are designated. in 
this system, the rank in mobility is: Br > IXaCS > 
IXq:C12 > IXaCS , IXQ!C12 . Cells transfected with pHUG- 
Br2 or pHUG-BrI, which contained the insert in the 
correct orientation with respect to the promoter 
element in psVL, are shown in A (Lane i) and B (Lane 
2) , respectively. Cell homogenates were assayed as 
described for microsomes. Control homogenate from 
cells transfected with pHUG-BrI in the reversed 
orientation is shown in B (Lane 3). The sensitivity 
of the product generated by HUG-Brl-encoded activity 
to B-glucuronidase is shown in B, Lane 1. 
Unconjugated bilirubin is more sensitive to oxygen 
than its glucuronide or methyl ester derivatives. 
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and the oxidized bilirubin vill remain at the origin 
in this system. A nonspecific band (NS) is present 
in each reaction. 

Figure X2 . A comparison of the amino acid 
sequences of various UDP-glucuronosyltransferase 
forms; rat phenol UDP-glucuronosyltransf erase (Rat 
PHENOL) , human phenol UDP-glucuronosyltransf erase 
(HLUGPl), human bilirubin UDP-glucurono- 
syltransf erase,, type I (HUB-BRl), human bilirubin, 
type II (HUG-BR2), and rat bilirubin UDP- 
glucuronosyltransf erase (Rat Bil) . As shown in the 
figure, the carboxyl terminus of the various UDP- 
glucuronosyltransf erase forms is conserved as 
between the human and rat species and the phenol and 
bilirubin acceptor substrate forms. The human 
forms, HLUGPl, HUG-Brl and HUG-Br2, show exact 
homology between amino acids 283-531, 285-533, and 
286-534, respectively. The rat forms. Rat PHENOL 
and Rat Bil, show exact homology between amino acids 
281-529 and 283-531, respectively. Further^ there 
is a 90.6% conserved amino acid sequence between the 
two species, in the nonconserved region of Rat 
PHENOL, GLUGP 1, HUG-BRl and HUG-BR2 and Rat Bil, 
corresponding to amino acids 1-280, 1-282, 1-284, i- 
285, and 1-282, respectively, there is 66% homology 
between HUG-BRl and HUG-BR2, 70% homology between 
HUG-BRl and Rat Bil, 7 6% homology between HUG-BR2 
and Rat Bil, 84% homology between Rat PHENOL and 
HLUGPl, and 60% homology between Rat Bil and HLUGPl. 

Figures 13 A and B. Restriction maps of 
cosmid clones containing the genes coding for HUG- 
Brl and HUG-Br2. Note Figure 13A is continuous with 
Figure 13 B, overlapping at the arrows shown on each. 
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Sequence data showed that the cDNAs 
contained identical 3- ends (1469-bp in length) to 
each other and to that of the human phenol 
transferase cDNA, HLUGPl (Harding et al., Proc. Ma^i 
Aca. Sci . . _USA 85:8381 (1988)). (The phenol isozyme 
specifically conjugates a series of hydroxyl- 
substituted aromatic compounds.) m contrast, 
HLUGPl, HUG-Brl and HUG-Br2 have unique 5- ends 
(879, 882- and 912-bp) which encode the first 286 of 
534, 288 of 533, and 289- amino acids of 534 
respectively. The identity in the 3- region 'of the 
3 clones suggested that a gene arrangement exists 
such that primary transcripts containing unique 
exons are alternatively spliced to one or more 
common exons to account for the structural 
similarities. Sato et al. ( Biochem. n^op Hy. 
Cosffiu 169:260 (1990)) have shown that a rat 
bilirubin transferase cDNA contains a 3- end 
identical to that of the rat phenol transferase cDNA 
(lyanagi et al., J. Biol, ch^m 261:15607 (1986)). 
The carboxyl-termini between rat and human isoforms 
are 90% identical. 

Furthermore, prior to- these findings, it 
was not known that more than one bilirubin 
metabolizing isoform exists and that one of these is 
responsive to phenobarbetal treatment. 
Phenobarbital has been the agent for treating and/or 
diagnosing patients with certain defects with 
bilirubin glucuronidating activity. The cDNAs 
provided the necessary probes for isolating 
sequencing and describing the genes encoding the 
isoforms. The purification and nucleotide 
sequencing of the genes encoding the bilirubin and 
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phenol transferase isoforms and all flanking regions 
provide all the necessary information to describe 
and verify all critical and non-critical mutations 
which may lead to an impairment or total loss of 
bilirubin glucuronidating activity in affected 
individuals. The gene complex arrangement with an 
independent promoter for each encoded isoform and 
the sharing of 4 common exons provide a basis for 
the human phenotypes seen with respect to impaired 
bilirubin and phenol transferase activities. The 
sequence data of the normal gene allows one to 
amplify and sequence all relevant regions of a 
defective gene to determine alterations and to 
confirm the effect of the mutation on activity by 
altering the cDNA before expression in an 
appropriate host cell. Prior to the present 
invention, there was no available gene-based method 
for approaching or developing an analysis of 
abnormal detoxification of bilirubin in humans. 

Primary messenger RNA transcripts are 
predicted by each unit as shown in Figure 2 (panel 
B) and are further predicted to contain different 5 • 
termini but identical 3' termini. The model 
indicates that each transcript undergoes 
differential splicing of the first exon to exon 2 
(common exon) and then is further extended with 
exons 3, 4, and 5 (common exons) to produce mature 
RNAs approximately 2.6-kb in length. 

Two of the 6 transcriptional units, UGTIA 
and UGTID, respectively, represent the 
transcriptional units which encode mRNAs 
corresponding to HUG-Brl and HUG-Br2 bilirubin 
transferase cDNAs prepared from mRNA isolated from 
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DgTAILKn DESCRTPTTO N OF THF TNVENTION 

The subject invention relates to the UGTl 
The nucleotide sequence of UGTl is shown in 
Figure i. UGTl is a unique gene complex featuring 
six nested transferase transcriptional units, which 
are depicted in Figure 2 (panel A) . Each of these 
units, UGTIA-UGTIF, encodes a unique transferase 
isoform. The complex contains a tandem array of six 
promoters (Fig. 2A, arrows) with each positioned 
adjacent to a first exon (Fig. 2A, designated boxes, 
«880-960-bp each) , thereby defining the 
transcriptional start site for the unit. 

The six unique amino acid termini of 28 6- 
289 amino acids are encoded by the six different 
first exons and identical carboxyl termini of 24 6 
amino acids are encoded by the common exons 2-5. it 
is thought that critical mutations introduced into 
any of the common exons 2-5 will result in the 
inactivation of all encoded isoforms. 
Alternatively, a critical mutation in exon 1 or a 
specific regulatory region can generate a specific 
defect in that encoded isoform or responsiveness to 
drug treatment, respectively. Mutations in 
sequences necessary for RNA splicing (e.g., upstream 
of a common exon) can generate a defect in all the 
transferase isoforms. The evidence from the gene 
structure and the presence of corresponding mRNAs 
indicate that typical liver microsomal bilirubin UDP 
glucuronosyltransferase is necessarily composed of 
at least two distinct isozymes encoded by the cDNAs 
HUG-Brl and HUG-Br2 (Ritter et al. , J. Biol. Chem. 
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266:1043-47 (1991)) which are encoded by the UGTl 
locus , 

The discovery of the two bilirubin 
transferase cDNAs of the present invention, HUG-Brl 
and HUG-Br2, as described below, provided and * 
established for the first time the nucleotide 
sequence data which code for any human bilirubin 
transferase enzyme . 

In particular, the present invention 
relates to the isolation and characterization of 
these two human liver bilirubin UDP-glucurono- 
syltransf erase cDUAs, referred to as HUGBrl and 
HUG-Br2 (Ritter, et al., J> Biol, chem, 
266:1043-1047 (1991)) which, upon expression 
individually in COS-1 cells, encode isoforms that 
catalyze the formation of the two bilirubin 
monoglucuronides and the diglucuronide . Northern 
analysis of HUG-Brl and HUG"Br2 mRNA showed that 
HUG-Brl is 4-fold more abundant than HUG-Br2, and 
that, in the Ervthrocebus patas monkey, mRNA 
hybridizing with HUG-Br2, but not HUG-Brl, is 
inducible by phenobarbital treatment. It was also 
discovered that mRNAs corresponding to both 
bilirubin transferase isoforms were present in the 
explanted liver of a CN, Type I patient, (Ritter et 
alw J> Biol> Chem. 266:1043-1047 (1991))* The 
observations that the mRNAs were of the normal size 
(»2*6kb) and abundance suggest that neither 
defective regulation nor gross abnormalities in gene 
structure is causative and that smaller point 
mutations are probably involved- 
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normal human liver. A third unit, UGTIF, encodes a 
phenol-specific isoform (Dutton, Glucur-nn,M.^-i^^ 
Drugs, and other Comronndc-., CRC Press, Boca Raton, 
Fl., pp. 3-78 (1980)). 

Exon 1 of UGTIA, UGTlD, and UGTlF is 
located 5.6-, 49-, and 73-kb, respectively, upstream 
Of the 4 common exons (See Fig. 3B and 3C.) Three 
additional unique exon Is of UGTIBP, UGTIC, and 
UGTIE Which are -90% identical to and of the same 
length as that of the HUG-Br2 exon (UGTlD) , but 
never before uncovered, were identified 19-, 38-, 
and 54-kb upstream of the 4 common exons (Fig. 3B 
and 3C) . The first exons of UGTlC and UGTIE are 
predicted to encode the unique regions of proteins 
whxch are similar to the HUG-Br2 protein. The first 
exon of UGTIBP has a frame-shift mutation which 
creates a premature stop codon and, presumably, a 
truncated protein. it is, therefore, designated a 
pseudogene (P) . Because of the 90% identity of the 
exons is with that of UGTlD, it is possible that 
UGTlc and UGTIE encode bilirubin transferase 
isoforms with properties similar to that encoded by 
HUG-Br2 . ^ 

Each of the 6 unique exon Is of UGTlA to 
25 UGTlF has an RNA polymerase li transcriptional 
promoter element, TATATATATATATAA , TAATTAA, or 
TATCAAA, upstream of the predicted cap site of the 
ntRNA as detailed in Fig. 3C and 3E. The presence of 
these elements at position - 37 to -23 bp (UGTIA) or 
30 -31 to -25 bp (all other exon Is) from the cap site 
IS consistent with the typical location of TATA 
boxes. The presence of individual promoters create 
the condition whereby each encoded isoform can have 
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independent regulation and tissue-specific 
expression. Hence, an independent mutation in a 
promoter region can specifically affect the 
expression of that isoform. 

Each exon l has a 3' exon/intron junction 
consistent with a donor splice site (data not 
shown). The common exons 2, 3, and 4 contain an 
acceptor and donor splice site, common exon 5 
contains only an acceptor site. The model predicts 
the synthesis of at least 6 nested primary 
transcripts ranging from 15- to 95-kb (Fig. 3D) 
where each exon 1 is differentially spliced to the 4 
common exons to generate 6 different mRNAs. 

The exons Is uncovered in this invention 
15 allow one to explain substrate specificity of the 

isoforms and to explain defects in humans where only 
bilirubin transferase activity is impaired and not 
phenol transferase activity. Furthermore, the 
discovery of other bilirubin transferase-like 
encoded isoforms (UGTIC and UGTIE) potentially 
provide other expressible forms with the appropriate 
drug treatment. Hence, UGTIA (HUG-Brl) and UGTID 
(HUG-Br2) are now known isoforms which detoxify 
bilirubin. Moreover, UGTlC and UGTlE are possibly 
25 (based on 90% identity to UGTID) bilirubin 

transferase isoforms. The identification of UGTIA, 
UGTIC, UGTID, and UGTIE allows one to determine if 
nucleotide alterations exist in each of the 
corresponding exon Is as well as in the 4 common 
3 0 exons in humans with defective bilirubin transferase 
activity. Furthermore, upon establishing all the 
types of defects which exist in human subjects, 
nucleotide oligcs can be designed and applied to the 
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diagnostic screening of this population of 
individuals. 

Preliminary data show that DNA from the 
explanted liver from a CN, Type I patient has a 
codon (3 nucleotides) deletion in CJGTIA without any 
alterations in the common exons. This individual 
conjugated 12% of the bilirubin in the bile, 
presumably due to a normal activity encoded 'by the 
UGTID mRNA Which is expressed at a low level. The 
RNA studies on samples from this individual have 
been reported (Ritter et al., J. Biol, rh.^ 
266:1043-47 (1991)). This individual was reported 
to have normal phenol transferase activity. 

Comparisons of the UGTl gene complex with 
the only characterized transferase gene (Mackenzie 

al., J , Biol. ChPm_ 265:11328 (1990)) (rat, 
testosterone) show that the size and distribution of 
exons in that simple gene are very similar to those 
m thxs complex. For the rat gene, exons 1 and 2 
code for the 293-NH,: -terminal region, and exons 3 to 
6 code for 44, 29, 74, and 91 amino acids. For the 
UGTl locus, exons 2 to 5 code for 44, 29, 74, and 99 
amxno acids, respectively. The similarities suggest 
that other domains are represented in the 3 • exons 
which are not yet understood. 

Complementary DNA cloning studies 
established that the mRNAs encoding UGTIA, UGTID 
and UGTIF are expressed in liver, and induction 
studies indicate differential responsiveness of 
promoter elements (Ritter et al . , j. Bim . rv,.^ 
266:1043 (1991) and Harding et al., Proc. N.-e-i 
^^M^-Sci^-im 85:8381 (1988)). Northern blot 
analysis (Fig. 6A) establishes that the relative 
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level of mHNA in liver encoding UGTIA, UGTID, and 
UGTIF is 5:2:1, (The present inventors have also 
demonstrated that the mRNA coding for UGTID, but not 
UGTIA, is induced in monJcey by phenobarbital 
treatment. (Ritter et al., J. Biol. Chem, 266:1043 
(1991) ) . Preliminary sequence analysis of UGTID- 
like cDNAs generated by RT-PCR approach showed that 
UGTID- and UGTIC-specif ic poly (A+) RNA account for 
88 (44/50) and 12% (6/50), respectively, from a 
single individual (Harding et al., supra (1988)), 
Figure 6 shows that UGTlF mRNA is expressed in 
kidney and skin at levels equal to or higher than 
that in liver, whereas the mRNAs coding for UGTlA 
and UGTID are not detected in kidney and skin. 
^^^^^ liver is the only known site for bilirubin 
transferase activity (Chowdhury et al,, Jhe 
Metabolic Basis of Inherited Diseases (Scriver et 
al., eds.) McGraw Hill, N.Y, pp. 1367-1408 (1989)). 
Thus, the multiple initiation sites and splicing to 
produce alternative protein products have major 
consequences for cellular phenotype* The liver 
selectively processes bilirubin for clearance 
through the biliary system, while phenol clearance 
is not restricted and takes place in many different 
tissues. The presumed similarities in the chemical 
nature of bilirubin and phenol and/or the biological 
circumstances leading to the establishment and 
evolution of a single gene complex to encode 
detoxifying enzymes are not obvious* 

The implication of this gene arrangement 
is that multiple bilirubin transferase isozymes have 
evolved to cope with and prevent the toxic effects 
of the ever-present waste product derived from 
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hemoprotein turnover, it is known that bilirubin 
represents the only biological toxin in maminals 
undergoing constant replenishment: a) from dying red 
blood cells and, b) from episodic increases due to 
red blood cell hemolysis. Thus, the toxin most 
likely represents a powerful selective pressure to 
evolve an efficient detoxifying system. Duplication 
events within the UGTlD-like alleles have likely 
been in response to the need to expand the only 
mechanism for detoxifying this heme derivative. The 
syndromes in humans attributed to mild elevations in 
serum bilirubin are in response to a modest decrease 
in bilirubin transferase activity. This reciprocal 
relationship likely signals that the constitutive 
level of the enzyme is not in excess for 
metabolizing this endogenous toxin. Furthermore, 
the inducibility of the mRNA encoiding the UGTID 
gene by phenobarbitol expands the glucoronidating 
capacity and provides greater flexibility to the 
organism in responding to environmental factors. 
The gene arrangement is flawed by the obvious 
prediction that a single critical mutation in a 
common exon can inactivate the entire subfamily of 
isozymes. Such a critical mutation is found in the 
genome of the first patient with Crigler-Naj jar , 
Type I disease to be described. 

As supported by Example II, the present 
inventors have discovered a homozygous 13 -bp 
deletion in common exon 2 of the gene complex, UGTl, 
in a patient with CN, Type I, a deletion which is 
related to the claimed invention. The nature of 
this defect is consistent with inactivation of the 
gene product. Figure 4B shows the proposed effect 
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of the frameshift mutation which results in the 
synthesis of truncated forms of both bilirubin 
transferase isozymes. Figure 4A shows the site 
leading to a change in its predicted translation. 
The FB sequence would, therefore, be very different 
from that of a normal individual after codon 294. 
Continuation of translation of the open-reading- 
frame (not shown in Fig. 4 A) shows a TGA stop signal 
after the 3 65th codon (illustrated schematically in 
Fig. 4B) ; thus, a 72-residue unrelated peptide is 
present after codon 294. 

As deduced by translation of the altered 
coding region, the bilirubin transferases 
represented by UGTIA and UGTID are predicted to be 
missing the final 240 amino acids of the wild type 
proteins, a region in the phenol/bilirubin 
transferase gene which is highly conserved across 
species (90% identity between rat and human) (Ritter 

J. Biol. Chem. 266:1043-1047 (1991)). This 
region contains two smaller domains /motifs that have 
been recognized as highly conserved among 
transferase cDNAs encoding members of both major 
subgroups so far identified, the steroid and 
bilirubin/phenol subfamilies (Tephly et al., TIPS 
11:276-279 (1990)). The membrane-spanning domaiii at 
the extreme carboxyl-terminus of all known 
transferases is missing. The absence of this 17 
consecutive hydrophobic residues normally flanked on 
the amino temninal side by a charged residue (Asp) 
and the carboxyl terminal side by a highly 
positively charged halt transfer signal (Lys/Arg 
rich) should cause the UGTl-encoded proteins to 
distribute in the cytosol. In addition, the highly 
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conserved 18-residue motif including the first 6 
residues of the membrane spanning domain (present in 
all transferase isozymes and utilized as a strategy 
for the cloning of the human (Ritter et al. j. Bio] . 
Ciieitu. 266:1043-1047 (1991)) and rat (Sato et. al. , 
Biochem. Biophys. Rss. Cnm-m . 169:260-264 (1990)) 
bilirubin isoforms) is also missing in FB. its 
function is currently unknown although it has been 
speculated that it participates in the binding of 
the common donor substrate, UDP-glucuronic acid. 

In addition to the predicted loss of four 
codons (294-297) , as noted above, the truncated 
proteins are predicted to contain a foreign 72- 
residue carboxyl terminus, resulting from the -1 
shift in reading frame. A i-bp deletion with a 
similar consequence (i.e., the introduction of a 
premature stop codon) was described at the cDNA 
levels for phenol and bilirubin transferase mRNAs of 
the hyperbilirubinemic Gunn rat, considered a model 
of CN, Type I disease (Schmid et al. , j. clin. 
Invest . , 37:1123-113 (1958)). lyanagi et al, , 
Biol. Chem. 264:21302-21307 (1989)) showed that 
defective 3-methylcholanthrene-induction of phenol 
glucuronidating activity was due to a -1 frame shift 
mutation at codon 414 in the mRNA for the phenol 
form (lyanagi, et al. J. Biol, chp^. 261:15607-15614 
(1986)) (which is the orthologue of the form encoded 
by the human UGTIF) . Presumably, the gene 
arrangements are similar between human and rat. 
This frameshift would, therefore, occur in a common 
exon of the corresponding locus in the rat. This 
mutation results in the synthesis of a truncated 
enzyme (Elwady et al. J. Biol, chf^m. 265:10752-10758 
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(1990)). More recently, Sato et al, showed that the 
genetic defect in bilirubin UDP-glucurono- 
syltransf erase in the Gunn rat was due to the same 
mutation (Sato et al. , Biochem. Bioohvs. Res. Comm. 
177:1161-64 (1991)). These data together with^the 
recent determination of the UGTl gene structure 
provide an understanding of the complex arrangement 
of the locus and allow the prediction that the other 
forms encoded within UGTl are also defective in the 
CN, Type I patient studied here, including the 
isozymes encoded by UGTIBP, UGTIC, and UGTIE. 

Both Crigler-Najjar Syndrome, Type I, and 
the hyperbilirubinemia seen in the Gunn rat, are 
transmitted as autosomal recessive disease entities 
(Childs et al. Pediatrics. 23:903 (1959); Szabo et 
Acta Paediatr, Hung, ^ 4:153 (1963); and Arias 
®^ ^^-z Am. J. Ked. 47:395-409 (1969)). In humans, 
CN, Type I is a rare disorder which was first 
described in the literature in 1952 (Roy Chowdhury 
et al,. In The Meta bolic Basis of Inherited Disease , 
pp. 1367-1408 (1989)) and is often a consequence of 
consanguineous parents. As for the case described 
here, the heterozygosity and consanguinity of the 
parents are the basis of the homozygosity of FB at 
the UGT1*FB locus. 

Using PGR, the present inventors 
conclusively demonstrated that both parents are 
carriers of the same defect seen in FB* Whereas 
only a shortened band was amplifiable from the DNA 
of FB, bands of both the deleted size and normal 
size were amplifiable from that of the parents (Fig. 
5) . The PGR assay used herein may be utilized as 
one in a series of diagnostic markers—among others 
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to emerge from future characterization of the 
different defects associated with heritable 
hyperbilirubinemic disorders (e.g. CN, Type II or 
Gilbert's syndrome). 

5 Although it is known that a i3-basepair 

deleteion accounts for the UGT1*FB ::n. Type I case, 
the existence of multiple phenotypes associated with 
CN, Type I (Arias et al.. Am. J. Med. 47:395-409 
(1969); Robertson et al. J. Inher. Metab. Pis. 
10 14:563-579 (1991)) supports the notion that multiple 
types of mutations in UGTl are involved. To date, 
the present inventors have uncovered at least two 
other types of alterations in CN, Type I patients: 
a) a single nucleotide change in common exon 2 which 
15 creates an amino acid change at residue 309, a 

conserved region of the molecule, and b) a codon 
deletion in exon 1 of UGTIA leading to the deletion 
of residue 170. Furthermore, Van Es et al. (Van Es, 
®^ ' J- Clin . Invest. 85:1199-1205 (1990)) 
performed a biochemical and immunochemical analysis 
of four different (unrelated) CN patients with no 
detectable transferase activity toward bilirubin. 
Whereas patients A and B had reduced microsomal 
transferase activity towards 5-hydroxytryptamine and 
25 phenol, patients C and D had normal activities 

toward these substrates. Furthermore, by using two 
different monoclonal antibodies (WPl and HEB7) 
prepared against phenol and bilirubin UDP- 
glucuronosyltransf erase. Van Es et al. showed that 
two or more immunostained isozymes were absent in 
patients A and B, whereas in patients D and C, the 
polypeptide patterns were similar to controls. 
These results also suggest a genetic heterogeneity 
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among CN^ Type I patients. Some are apparently able 
to synthesize full length (but presumably inactive) 
bilirubin transferases, whereas others (such as FB 
described here) synthesize truncated versions. From 
the results contained herein, one can predict that 
microsomes from patient FB (who has not undergone 
liver transplant surgery and therefore has not yet 
been tested) will exhibit the phenotype which 
characterized Patients A and B, It also suggests 
the possibility that among CN patients mutations are 
possible which selectively inactivate one or both of 
the bilirubin isoforms while sparing the remaining 
forms encoded by the UGTl locus (e.g. the UGTIF- 
encoded phenol form). It is expected that analysis 
of additional CN patients will uncover a host of 
different mutations. Preliminary finding of a 
different alteration in exon 2 of a second CN, Type 
I patient and a codon deletion in exon 1 of UGTIA of 
a third CN, Type I patient confirm the prediction of 
the present inventors that many types of genetic 
alterations account for the CN, Type I phenotype. 
The above-described invention has many 
significant uses. As noted above, the present 
inventors have determined the sequence of all known 
exons encoding bilirubin UDP-glucurono- 
syltransferases within the «90 kb span of the normal 
UGTl gene complex. Thus, the normal sequence data 
have allowed the inventors to detect genetic 
alterations in the genomic DNA from three different 
Crigler-Naj jar, Type I patients. FB has been 
described in detail in example IV with confirmation 
that each parent's DNA is heterozygous for the 
defect. Although two other genetic alterations have 
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been determined, it has not been confirmed that 
either the single nucleotide change leading to an 
amino acid change or a codon deletion leading to an 
amino acid deletion generates a loss of enzyme 
activity, with the use of pairs of primers (at 
least 17 nucleotides in lehgth) in the UGTl gene 
complex, one can amplify any target region of the 
genome from normal and defective DNA by the PGR 
methodology using both a Perkin-Elmer kit and a 
thermocycler, for example, to generate sufficient 
DNA for subcloning. subclones will be sequenced by 
the method described above to establish sequence 
data for comparison to normal by using, for example, 
the IBI Pustell sequence analysis analysis software 
15 package. Each defect will provide a site for a 
diagnostic probe. 

The above-described methodologies can be 
applied to locate other altered nucleotide sequences 
in the DNA from other patients with Crigler-Na j jar , 
Type I, Crigler-Najjar, Type II, and Gilbert's 
syndromes to uncover all mutations or variations in 
th- nucleotide coding sequences or in fliinking 
regions of the UGTl gene complex and other mammalian 
bilirubin UDP-giucuronosyltransf erase genes. 

In order to establish a diagnostic pr^^be, 
sufficient quantities of the two types of DNA, as 
well as samples of HUG-Brl and HUG-Br2 cDNAs, can be 
analyzed by Southern blot analysis (Jaye et al. , 
Nucleic Acid Research. 11:2325-35 (1983)). Each blot 
may be made in duplicate. An approximately 18-base 
oligo reflecting the altered sequence in the 
Crigler-Najjar, Type I, as well as an approximately 
18-base oligonucleotide reflecting this sequence in 
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normal DNA can be made and each can be radio label led 
"to at least 2 x 10* dpm/ug by T4 polynucleotide 
kinase and/ or terminal transferase to be used as a 
probe- Preliminary studies can be carried out to 
5 establish wash conditions for each normal and 

altered sequence (see Sarafarook et al,, In Molecular 
Clonina-A Laboratory Manual ) . Following 
hybridization, high stringency wash conditions (as 
established for the filters) can be carried out in 
10 combination with the test sample of DNA present on 
each filter. The validity of the diagnostic probe 
can be established by the selective hybridization 
(under high stringency wash conditions) of each 
probe to target DNA containing a perfect match. 
15 The PCR methodology could also be utilized 

diagnostically in a slightly different manner. One 
of a pair of normal primers (not altered to match 
the defect) could be selected to include the defect 
in the genome. It would be predicted that PCR 
20 product could be generated from the genome of the 
normal individual and that of both heterozygous 
parents for the defect, but not that homozygous for 
the defect* The failure to generate product would 
be diagnostic for a defect at the site of the 
25 specified primer. 

Similarly, one can analyze genomic DNA 
from patients with Crigler^-Naj jar , Type II and 
Gilbert's syndromes for altered DNA sequence (s) 
compared to normal. Any altered DNA sequence can 
30 serve as a target for a diagnostic probe to be 
compared to this sequence in normal DNA. 

The present invention can also be used 
with respect to gene therapy. For example, Crigler- 
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Najjar, Type I disease, a disease which is fatal for 
most patients between the ages of 3-20, could 
potentially be treated using the claimed invention. 
(Liver transplantation has been the only effective 
long-term treatment.) in particular, the present 
invention provides two cDNAs which encode two 
different bilirubin UDP-glucuronosyltransf erases 
which can be used as agents for gene therapy in 
order to correct the defect in the expression of 
this activity. Although gene therapy research is in 
its infancy, progress is being made with animal 
models. The protocols relevant for the bilirubin 
transferase gene therapy model necessarily deal with 
liver. Currently, amphotropic retroviral vectors 
which have been engineered to be non-disease 
producing are utilized as agents for the 
transmission of foreign DNA into cultured primary 
liver cells prepared from a portion of the liver 
from an affected animal (individual) . After the ex 
vivo gene replacement, the cells are injected into 
the spleen of the same animal. From this site, most 
of the cells migrate to and remain in the liver. 

The present invention can be illustrated 
by the use of the following non-limiting examples: 

Example T 

Seguencina of CM. Ty pe i Huinan samples 
The following protocol was utilized in 
order to sequence the samples of interest: CN. Tvn^. 
.1 human samples; Blood samples were collected from 
FB, a 4 year old male CN, Type I patient, his 
parents, and an unrelated normal female, AR- FB, 
the offspring of consanguineous parents, had average 
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serum bilirubin concentrations of 598 fMmol/litQi: 
which are not affected by phenobarbital therapy. 
Hemoglobin was 19.6 gTn%, and the reticulocyte count 
was 0»7%- The mother was O negative and the baby 
was O positive, both direct and indirect Coombs' 
tests were negative. The results of the following 
studies were normal: urinary menthol glucuronides 
following oral administration (Szabo et al* , Acta 
Paediatr. Huncr. > 4:153 (1963)), transaminases, 
alkaline phosphatases, 5 '-nucleotidase, total bile 
acids, thyroid function tests, PT, PTT, G6PD screen, 
pyruvate kinase screen, a-antitrypsin levels, and 
various bacterial cultures. Unconjugated 
hyperbilirubinemia persisted despite repeated 
exchange transfusions and phototherapy. A liver 
biopsy at 6 weeks of age confirmed normal histology 
and the absence of bilirubin UDP-glucurono- 
syltransf erase activity. Bilirubin glucuronides 
were absent in duodenal bile, serum, and urine. 

Preparation of lymphocyte genomic DNA : 
The blood samples were transferred to 
50-ml Falcon tubes and spun at 4000 rpm for 30 min 
in a HS-4 rotor (Dupont-Sorvall) . The pellet was 
resuspended in 40 ml of prechilled RSB (10 mM 
Tris-Cl, pH 7.5, 10 mM NaCl, 3 mM MgCl, and 0.2% 
Triton X-100) , recentrifuged, and the step repeated 
a second time. The pellet was resuspended the final 
time in 2 ml of SDS solution (10 mM Tris-Cl, pH 7.5, 
20 mM NaCl, 5 mM MgCl, 2% SDS), Proteinase K (900 
Atg/10 ml volume blood) was added, and the tubes were 
incubated at 37**C for 2 hr and then at room 
temperature overnight. The samples were then 
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extracted five times with l:l volume of 
phenol: chloroform and dialyzed at 4°C overnight 
against lO mM Tris, pH 7.5, 1 mM EDTA with several 
changes. 

Oligonucleotide primer svnthP.«i^ r oligonucleotides 
for PGR amplification of the four UGTl common exons 
[PXG3-PXG4 for exon 2, PXG5 and PXG6 for exon 3, 
PXG7 and PXG8 for exon 4, PXG9 and PXGIO for the 
protein coding region of exon 5] were synthesized 
using a Milligen/Biosearch cyclone DNA synthesizer, 
Model 8400 (Milligen/Biosearch, Bedford, MA) and a' 
200-coupling 6-phosphoaraidite kit 
(Milligen/Biosearch) . The oligonucleotides and 
their sequences are as follows: 
15 PXG3 5 • -CTATCTCAAACACGCATGCC-3 ' , PXG4 5 • - 
GGATTAGCGCTCCTGTGAA-3 • , PXG5 5'- 
GTCTTTCTTTACGTTCTGCTC-3 ' , PXG6 5 • - 
GACCCTGGTTTGACCTATAC-3 ' , PXG7 5 • - 
CTCAGAGATGTAACTGCTGAC-3 ' , PXG8 5'- 
20 CATGAATGCCATGACCAAAG-3 ' , PXG9 5'- 
GTTCATACCACAGGTGTTCCA-3 • , PXGIO 
5 • -GGAAATGACTAGGGAATGGTTC-3 ' , J127 5 ' - 
TCTGAGACCATTGATCC-3 • . 

After synthesis, the oligos were cleaved and 
deprotected with concentrated aqueous ammonia (29%) 
as suggested by the manufacturer, dried and 
resuspended in sterile water, re-precipitated with 
ethanol, and resuspended in 10 mM Tris, pH 8.0, 1 mM 
EDTA. Oligos were analyzed by electrophoresis 
30 through a 12% polyacrylamide gel. 

Polymerase chain reac-hinn, aubc] on i ng , an^ 
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sequencing of PG R productis : Primer combinations 
PXG3 and PXG4, PXG5 and PXG6, PXG7 and PXG8, or PXG9 
and PXGIO were used to amplify exons 2, 3, 4, or the 
first 312-bp of exon 5, respectively (along with 30- 
50 bases, on average, of flanking intron. Each PGR 
reaction contained genomic DNA (0.1 /xg) , each of the 
designated primer combinations (lO^M each), 10 mM 
Tris-HCl, pH 8.3 (at 25°C) , 50 mM KCl, 1.5 mM MgCl, 
0,001% (w/v) gelatin (Sigma, Cat. No. G2500, St. 
Louis, MO), 0.2 mM each dNTP and 2.5 units of 
Amplitaq polymerase. Reaction mixes were overlaid 
with mineral oil (SOjLtl) and subjected to thirty 
cycles of 94*'C for 1 min, 56*»C for 1.5 min, and 72 ''C 
for 3 min, using a Perkin-Elmer Cetus thermocycler 
(Emeryville, CA) • The final extension step at 72*'C 
was for 30 min. Using normal DNA, these primer 
combinations result in the amplification of 
fragments which are 311, 164, 309, and 398-bp in 
length for exons 2 through 5, respectively, 

PGR products were polished with Klenow, 
phosphorylated with T4 polynucleotide kinase, and 
subjected to electrophoresis through 1% low melting 
point agarose. Bands were visualized with ethidium 
bromide, excised, and blunt-end ligated with SmaX- 
cleaved and calf-intestinal alkaline phosphatase- 
dephosphorylated pBluescriptIISK+(0 . 1 /ig) using T4 
DNA ligase (1 unit) . After incubation at 16*=»C for 
hr, the solidified agarose ligation reactions were 
melted, diluted ten fold with RF2 buffer (Hanahan, 
et al,. In DNA Clonincr Volume I: A Practical 
Approach , pp, 109-135 (1985)), and 50 ^1 was used to 
transform 10 0 fxl of frozen competent coli strain 
XLl-Blue (Stratagene, La Jolla, CA) prepared using 
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Protocol #3 with the rubidium-containing buffers RFl 
and RF2 (Hanahan, et al- DNA Cloning Volume I: A 
Practical Approach, pp. 109-135 (1985)). 
Ampicillin-resistant transf ormants were cultured in 
7 ml of LB Broth which contained ampicillin (.100 
Mg/ml) , and DNA was purified and sequenced as 
previously described (Ritter et al. , J. Biol. Chem. . 
265:7900-7906 (1990)). Each sample was sequenced in 
its entirety in both directions using the Bluescript 
primers KS and SK (Strategene, La Jolla, CA) . Two 
or more of each subclone were sequenced. 

Materials; Amplitaq polymerase was from 
Cetus (Emeryville, CA) , T4 polynucleotide kinase 
from Bethesda Research Laboratories (BRL, 
15 Gaithersburg, MD) , and the Large Klenow fragment 
from U.S. Biochemical (Cleveland, OH). All 
restriction endonucleases were from either New 
England Biolabs (Beverly, MA) , Pharmacia-LKB 
(Piscataway, NJ) , or BRL. Calf intestinal alkaline 
20 phosphatase was from Boehringer-Mannheim 

(Indianapolis, IN) . The vector pBluescript was 
purchased from Stratagene. 



Examp le TT 

Identification of a 13- Base v^^ir- Deletion in Exon 7 
The exons 2-5 (of Ex. I) were amplified 
individually using the polymerase chain reaction and 
subcloned as described in Example I. Analysis of 
exon 3, 4, and 5 subclones showed that each had the 
normal sequence (data not shown) • However, a 13-bp 
deletion (Fig, 4) was observed in each of four 
independent exon-2 subclones. To confirm the 
mutation, four additional exon 2 subclones were 
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tested from a second PcR reaction, indicating that 
the mutation was not due to errors by Taq 
polymerase. The deletion is located 12 -bp 
(downstream) from the intron 1/exon 2 junction (Fig- 
4) and causes a change in the deduced amino acid 
sequence after residue 293 as seen in Fig, 4A* The 
dark vertical line by the nucleotides of the normal 
DNA sequences represents the deleted bases in the 
DNA of the Type I patient. The alterations suggest 
there should be no gross effects on the level of 
mRNA transcription, size, or translatability . 

Example TTT 
Determina tion of Whether More Than One UGTl 
Locus Exists in the Genome 
To establish whether more than one UGTl 
locus exists in the genome, UGT1A-, UGT1D-, UGT1F-, 
and common exon-probes were used to analyze a set of 
Southern blots containing genomic DNA digested with 
three different enzymes predicted not to cleave exon 
1 of UGTIA, UGTIF, or exons 2, 3, 4, or 5 (the 
common ones) • A single band per enzyme digest for 
each probe, except for UGTID, indicates that a 
single UGTl complex exists in the genome (Fig- 6B) . 
Since the UGTlD-probe revealed 2 to 4 more bands 
than predicted with EcoRI and BamHI digestions, 
respectively CFig, 6B) , it is suggested that 2 to 4 
more UGTlD-like exon Is exist further upstream. The 
UGTl gene complex maps to chromosome 2 based on 
studies to localize the gene which encodes the 
HLUGPl cDNA. 
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Example TV 
Use Qf Agarose Ge l Electrnphoresis to 
Compare CN, Type I Patie n t Sample with S amp 1 tag 
From Parents and Tha t of a Normal Genome 

Although it was established by a pedigree 
study that the parents of FB (the patient in 
question) are consanguineous, a demonstration of the 
pattern of inheritance by FB of the defective allele 
was carried out. since CN, Type I is an autosomal 
recessive disease, it appeared that FB was probably 
homozygous for the same defective allele, UGT1*FB, 
and both parents were probably heterozygotic at the 
UGT1*FB locus. This hypothesis was confirmed using 
the polymerase chain reaction to amplify a fragment 
corresponding to the last 45-bp of intron 1 and 
first 154-bp of exon 2 followed by visualization of 
the reaction products with agarose gel 
electrophoresis (Fig. 5). Whereas a single band of 
the expected normal size (199-bp) was observed in a 
control sample from an unrelated normal individual 
(Lane 1) , the sample corresponding to FB (Lane 3) 
contained a single band corresponding to a predicted 
fragment shorter by i3-bp (185-bp) . These 
observations are consistent with FB being homozygous 
25 for the allele with a 13-bp deletion. In 

comparison, the mother (Lane 2) and father (Lane 4) 
exhibited both the 199-bp and 186-bp bands in equal 
amounts. Two other slower migrating bands not seen 
in either the normal or CN, Type I case were present 
30 in the parental samples. A sample representing a 
reaction which included a mixture of FB and normal 
DNA (Fig. 5, Lane 5), thereby simulating the 
composition of heterozygotic DNA, also exhibited 
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these sized bands. These observations are 
consistent with those from other PCR studies (Keen, 
®^ Trends Genet. 7:5 (1991)) with multiple 

alleles which suggest that the bands are due to 
mismatched DNA strands (i.e., heteroduplexes) which 
migrate differently from those which are perfectly 
matched , 

Example V 
Preparation n- P HUG->Br1 and mJG-Br2 
Poly (A) 'RTTA was prepared from human adult 
liver biopsy and converted to double-stranded cDNA 
using the Boehringer Mannheim cDNA synthesis kit and 
oligo (dT) as primer. The cDNA was ligated with 
EcoRI-Sall adaptors and cloned into the EcoRI site 
of lambda-ZAP bacteriophage (Stratagene, San Diego, 
CA) . Unamplified phage (0.5 X 10* pfu) were screened 
with a 51-bp probe. In order to design a universal 
probe to detect all transferase cDNAs including that 
which is specific for bilirubin, all characterized 
(Mackenzie et al. , J. Biol. Ch^m. 261, 6119-6125 
(1986); Mackenzie et al. , J. Biol. Chem. 261, 
14112-14117 (1986); Harding et al,, Proc.-Natl- 
Acad. Sci. U.S.A., 85, 8381-8385 (1988); lyanagi et 

J> Biol. Chem. 261, 15607-15614 (1986); Jaye et 
Nucleic Acids ReseaT-r^h 2325-2335 (1983); 

Ritter et al, , J. Biol. Chem . 265, 7900-7906 (1990)) 
transferase clones were analyzed for conserved 
sequences. It was uncovered that a 56-bp sequence 
is present near the carboxy terminus of all 
transferases examined. A 51-bp probe (5'-CAC GAC 
CTC ACC TGG TTC CAG TAG CAC TCT TTG GAT GTG ATT GGT 
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TTC CTG-3') corresponding to a 90% conserved motif 
(His-Asp-Leu-Thr-Trp-Phe-Gln-Tyr-His-Ser-Leu-Asp- 
Val-Ile-Gly-Phe-Leu) was used. This 17-ainino acid 
sequence starting with a histidine residue at 
approximately position 481 is located in the lumen 
of the endoplasmic reticulum and overlaps the 
membrane-spanning region by 6 residues. It is 
speculated that this sequence, preceded by a 
conserved arginine at -4 residue, is a possible 
binding site for the common donor substrate, 
UDP-glucuronic acid- The probe was labeled to a 
specific activity of 0.22 X 10' dpm/ug by fill-in of 
overlapping sense and antisense oligonucleotides. 
I The probe was hybridized as described (Jaye et al. , 

Nucleic Acids ReseaT-nh n, 2325-2335 (1983)) except 
that 45 and 30°c were the hybridization and wash 
temperatures, respectively. All manipulations of 
lambda-ZAP and excision of the Bluescript plasmid 
containing positive inserts were performed as 
20 recommended by the manufacturer. Four clones 

representing independent copies of the HUG-Brl cDNA 
and one clone of HUG-Br2 were isolated and 
characterized. The complete nucleotide sequence of 
both strands of Z17 and the longest of the HUG-Brl 
25 inserts, Z6, were determined by double-stranded 

dideoxy sequencing in conjunction with deletions as 
described previously (Ritter et al., J. Biol, chem. 
265, 7900-7906 (1990)). 

Complementary DNAs containing the 
30 remaining portion of the 26 and Z17 coding and 5» 
untranslated regions were generated using the 
anchored PGR (RACE) method of Frohman et al. (PRC 
Protocols; A Guide to Methods and Applications pp. 
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28-38, Academic Press, New YorJc (1990)), Antisense 
oligonucleotides (10 picomoles) corresponding to 
base positions +206 to +227 of the Z17-CDNA (5'-TGT 
GCC CTT AAA GTC TAA GTC A-3 • ) or +235 to +251 (5'- 
CTG CAA GGA AGG AAA GG-3 ' ) of the Z6-CDNA inserts, 
respectively, were used to prime first-strand cDNA 
synthesis. After a tailing reaction with terminal 
deoxynucleotidyl transferase and dATP, the products 
were used as templates in separate PGR reactions 
with the adaptor-poly (dT) (Frohman et al., PRC 
Protocols: A Guide to Methods and Applications pp, 
28-38, Academic Press, New York (1990) and a nested 
clone-specific oligonucleotide as primers. Major 
bands of approximately 470-bp (Z6-extended template) 
and 430-BP (Z17-extended template) were kinased and 
cloned into the ECORV site of Bluescript SK- 
(Stratagene, San Diego, CA) , and individual clones 
were sequenced. Clones produced from duplicate PGR 
reaction had identical nucleic acid sequences 
suggesting that mutations were not introduced by 
TaqI polymerase during amplification, nor were forms 
with similar primer sequences co-amplified. The 
longest clones contained 449- and 399-bp inserts, 
respectively. The fragments extended Z6 by 234-bp 
and 217 by 357-bp to complete the coding region with 
15- and 29-bp, respectively^ of 5'- untranslated 
sequence. Complete coding versions of the two new 
cDNAs, HUG-Brl (original clone 26) and HUG-Br2 (217) 
were assembled by ligating 26 and 217 to their 
respective 5 '-end by using a unique restriction site 
near the 5 '-terminus of each clone, Aval (+298 of 
HUG-Brl) and Xbal (4-360 of HUG-Br2) • In both cases, 
the nucleotide sequence across the junction site was 



2987 



PCT/US92/00282 



43 

verified by dideoxy sequencing. Both clones were 
judged to contain their bona fide 5 ' ends by virtue 
of the fact that a DNA fragment of the predicted 
size was successfully amplified from human liver 
cDNA using primers located at the 5 ' and 3 ' ends of . 
the complete coding cDNAs. The nucleotide sequences 
of HUG-Brl and HUG-Br2 have been submitted to the 
Genbank database. Further, E. coli transformed with 
the clones pHUG-Brl and pHUG-Br2 were deposited in 
the American Type Culture Collection in RocJcville, 
Maryland, U.S. A, under the Budapest Treaty on 
January 9, 1991, The clone pHUG-BrI (deposited on 
January 9, 1991 under the name " E. coli derivative, 
XL-1 Blue transformed with pBluescriptSK+ HUG-Brl") 
has been assigned accession number 68510. The clone 
pHUG"Br2 (deposited on January 9, 1991 under the 
name "E. -coli derivative XL-1 Blue transformed with 
pBluescriptSK+ HUG-Br2") has been assigned accession 
number 68509, 

Example VI 
Northern Blot Analysis 
Poly(A) *RNA was isolated (Chirgqin et al., 
Biochemistry 18, 5294-5299 (1979)) from two normal 
liver biopsies (A.K. and W.E., adult females) and 
from the liver of a Type I Crigler-Naj jar syndrome 
patient (female, age 3) who underwent transplant 
surgery (Kaufman et al., Heplatoloay 6: 1259-1262 
(1986)). mRNA was also isolated from the livers of 
an untreated and a phenobarbital-treated 
Erythrocebus patas monkey maintained on 15 mg % in 
its drinking water for over one year. The mRNA was 
electrophoresed and blotted as described (Ritter, et 
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al., J, Biol. Chem., 265: 7900-7906 (1990)). The 
filters were hybridized with "p-labeled probes 
corresponding to the most 5' 725-bp EcoRI- fragment 
of HUG-Brl and the most 5' 4 00-bp-fragment of HUG- 
Br2 (Church, et al., Proc. tfat. Acad, Sai. U,S',.A. 
81:1991-1995) . The amounts of RNA applied to the 
gel were normalized (McKinnon, et al., Mol. Cell, 
Biol. 7: 2148-2154 (1987)) by hybridization to "p- 
human cyclophilin cDNA. All probes were 
radiolabeled by random primed synthesis using a kit 
(Pharmacia) . 

From this hybridization with 5-specific 
probes from HUG-Brl and HUG-Br2 to Northern filters 
containing mRNA isolated from two normal individuals 
and a Type I Crigler-Na j jar patient, it can be seen 
in Figure 10 (Panel A) that the size of both 
messages is 2.6kb. The relative abundance of the 
HUG-Brl message is 3.0 fold higher than that for 
HUG-Br2. This determination is based on the amount 
of mRNA applied as normalized by hybridization to 
the "p-labeled-cyclophilin cDNA probe and the 
difference in the length of the 5' specific probes. 
By an analogous determination for the Crigler-Naj jar 
patient, the data show that the HUG-Br2 message 
level is approximately normal but that for HUG-Brl 
is depressed 4 -fold • The fact that both mRNAs 
encoding HUG-Brl and HUG-Br2 (although reduced in 
the case of HUG-Brl) are present in the Type I 
Crigler-Naj jar patient and yet activity remains 
refractory to all treatments suggests that a subtle 
defect (e,g., point mutation, small insertion, or 
deletion) in the common region of a possibly shared 
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exon or a highly mutable region of a multigene model 
underlies the defect in Type I Crigier-Na j jar . 

To determine if either species of mRNA is 
regulated by phenobarbital, the present inventors 
5 compared the levels from the liver of an Old World 
monkey maintained on phenobarbital in the drinking 
water. The results show that HUG-Br2, the lower 
abundance mRNA, was induced 2-fold but not that for 
HUG-Brl (Fig, 10, Panel B) based on normalization to 
10 cyclophilin mRNA. The existence of two bilirubin 
transferase isozymes and the responsiveness of one 
to phenobarbital administration suggest that 
elevation in activity after this treatment is due to 
the increased expression of the low constitutive 
isoenzyme encoded by HUG-Br2. The results also 
suggest that the higher abundance mRNA encoding HUG- 
Rrl is most likely responsible for constitutive 
bilirubin activity and that this version may be 
underrepresented in both Gilbert's syndrome and in 
Type II Crigler-Najjar disease. By analogy, huG- 
Br2-associated activity necessarily accounts for the 
elevation in activity after phenobarbital 
administration. 
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ExamDTf» vt t 
Assay for ff^ucurnm•r^ ^^ ^or, of "r-eilimh-in 
For the transfection studies the 
expression units, pHDG-Brl and pHUG-Br2, contained 
HUG-Brl and HUG-Br2 , respectively, in the correct 
orxentation with respect to the promoter element of 
the pSVL plasmid, and a control unit contained HUG- 
Brl in the reversed orientation. Plasmid 
preparations and transfection studies were performed 
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as previously described (Ritter, et al, , jr. Biol. 
Chem, 265: 7900-7906). Cells were incubated 72 hour 
at 37*»c, harvested, and stored at -70^0 or assayed 
fresh. 

Microsomes (2-9 mg) from 3-methylchblan- 
threne-treated GDI mice and cell homogenates (0.8 or 
1.9 mg) from transfected cells were assayed in a 
200-uL reaction mixture containing 50mM Tris pH 7.7, 
5mM MgCl, 20 mM UDP-glucuronic acid, 8.3 mM 
saccharic 1,4-lactone, and 25 uM "c-bilirubin 
(74,000 CPM, solubilized in 10 uL of 
dimethylsulf oxide) and incubated for 6.5 hour at 
37 °C. The sensitivity of the reaction product to 6- 
glucuronidase (where saccharic 1,4 lactone had been 
omitted) was then determined by adjusting the 
reaction to pH 7*0 with 1 M Na,HPO,, and adding 5 
units of enzyme plus 3 WL ascorbic acid and 
continuing the incubation for 1 hour at 37 ^'C. 
Microsomes and cell homogenates were preactivated 
with 0.5 mg of [3- (3-cholamidopropyl) - 
dimethylammonio]-l-propane (CHAPS) detergent to 1 mg 
protein before adding to the reaction mixture. All 
reagents used in the assay were made fresh in 
argonized double-distilled water, and all 
incubations were flushed with argon, sealed and 
incubated in the dark* In order to generate a more 
stable product for thin layer chromatography (TLC) 
analysis and to distinguish the forms of 
glucuronides , the '*C-bilirubin mono- and 
diconjugates were selectively converted to the 
corresponding mono- and dimethyl esters by using 
alkali-catalyzed transesterif ication in the presence 
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of methanol and ascorbic acid as published 
(Blanckaert, N. (1980) Biocheir, J. 185: 115-128; 
Hauser, et al., J. Biol. Chem . 259: 4527-4533 
(1984)). Enriched bile was added as a carrier 
5 before the transesterif ication to confirm the 

location of radioactive product. The methyl esters 
of bilirubin corresponding to the IXc^CS and the 
IX0CI2 monoglucuronides and the diglucuronide of 
bilirubin were extracted and chromatographed on TLC 
10 plates as described (Blanckaert, N. (1980) Biochem. 
1985: 115-128; Hauser, et al.^ J. Biol. Chem. 
259: 4527-4533 (1984)). Plates were analyzed for 
activity on an Ambis Radioanalytical Imaging Mark II 
System before exposure to x-ray film* 
^5 The assay for glucuuronidation of '^c- 

bilirubin by mouse microsomes was carried out to 
establish the validity of the method as shown in 
Figure 11 (Panel A) • Lanes 3 and 4 show, 
respectively, that the three bands attributed to 
20 IXaCS and IXaC12 isomeric monoglucuronides (BMEs) 
and the diglucuronide (BDE) were present in a 
complete reaction mixture but were totally absent 
when UDP-glucuronic acid was omitted. The relative 
mobility of bilirubin and its methyl ester 
25 derivatives in this stem is Br > IXaCS > IXaCl2 > 
IXoiCS, C12 according to the published method 
(Blanckaert, N. (1980) Biochem. J. 185: 115-128). 
The decrease (40%, as estimated by quantitation on 
AMBIS) in intensities of specific bands in Fig. 11 
3 0 (Panel A) Lane 2 shows that the product was 
sensitive to 6-glucuronidase after one hour 
incubation. The band intensities also show that 
bilirubin diglucuronide was the primary conjugate in 
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mouse microsomes. Similarly, Lane 1 in Figure 11 
(Panel A) shows that HUG-Br2- encoded activity in 
1.9 mg of cell homogenate yielded the same three 
converted bilirubin methyl ester isomeric products 
in about equal amounts. In a separate experiment 
(Figure. 11 (Panel B) ) , HUG-Brl-encoded activity in 
0.8 mg of cell homogenate also yielded the converted 
methyl ester derivatives in about equal amounts 
(Lane 2) , and the glucuronie products, before 
conversion to the methyl esters, were sensitive to 
13-glucuronidase (Lane 1) . Cells mock^transf ected 
with HUG-Brl contained in pSVL in the reversed 
orientation with respect to promoter did not yield 
(Lane 3) methyl ester derivatives as seen in Figure 
11 (Panel A) (Lanes 1, 2 and 3) and Figure 11 (Panel 
B) (Lane 2) . 

Examr>lQ VTTI 
Secnience Homology 
A comparison of the amino acid sequences 
of various UDP-glucuronosyltransf erase forms: rat 
phenol UDP- glucuronosyltransf erase (Rat PHENOL) , 
human phenol UDP-glucuronosyltransf erase (HLUGPl) , 
human bilirubin UDP-glucuronosyltransf erase, type I 
(HUG-Brl) , human bilirubin UDP-glucuronosyl- 
transf erase, type II (HUG-Br2), and rat bilirubin 
UDP-glucuronosyltransf erase (Rat Bil) , as set forth 
in Figure 12, shows that the carboxyl terminus of 
the various UDP-glucuronosyltransf erase forms is 
conserved as between the human and the rat species 
and the phenol and bilirubin acceptor substrate 
forms. The human forms, HLUGPl, HUG-Brl and 
HUG-Br2 , show exact homology between amino acids 
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283-531, 285-533, and 296-534, respectively. The 
rat forms, Rat PHENOL and Rat Bil, show exact 
homology between amino acids 281-529 and 283-531, 
respectively. Further, there is a 90.6% conserved 
5 amino acid sequence between the two species. 

In the nonconserved amino terminal regions 
of Rat PHENOL, HLUGPl, HUG-Brl, HUG-Br2 and Rat Bil, 
corresponding to amino acids 1-280, 1-282, 1-284, l- 
285, and 1-282, respectively, there is 66% homology 

10 between HUG-Brl and HUG-Br2, 70% homology between 
HUG-Brl and Rat Bil, 76% homology between HUG-BH2 
and Rat Bilo, 84% homology between Rat PHENOL and 
HLUGP 1, and 60% homology between Rat Bil and HLUGP 
1. Comparisons of the rat bilirubin and phenol 

15 transferase amino acid sequences supports an 

association between the amino terminal sequences and 
substrate specificity. This region in rat bilirubin 
transferase is more similar to both of the two human 
bilirubin transferases (76 and 70%) than to human 

20 phenol transferase (60%) , Similarly, the unique 

part of rat phenol transferase is much more highly 
related to the corresponding part of human phenol 
transferase (84%) than to either of the three 
bilirubin sequences (60-65%) - The strict 

25 conservation shown in the carboxyl termini in either 
species suggests exon sharing within a single gene 
may occur [alternative 5' splicing as suggested 
(Sato, et al., Biochem. Biophys- Res. Commun. 169, 
260-264 (1990)] to generate different isozymes in 

30 this transferase subfamily. 
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Example XX 

Identific ation of HUG-Bri from a Human Genomic 
Library Using a Probe Derived from HUG-Br2 

The 470-bp 5' end of HUG-Brl and the 450- 
bp 5 ' end of HUG Br2 were used as probes to screen a 
human genomic cosmid library. One clone was 
selected (approximately 40Jcb) which hybridized 
avidly to HUG-Brl and very weakly to HUG-Br2 and 2 
other clones (45 and 44kb) which hybridized to the 
unique region of HUG-Br2 and not to the unique 
region of HUG-Brl or to the common fragment of HUG- 
Brl and HUG-Br2 coding for amino acids 292-533. The 
three clones were determined by restriction enzyme 
analysis to overlap to generate a linear consensus 
fragment (97kb) (see Figure 13 A and 13 B) . Eight 
subclones (3-l4kb) were generated by digestion of 
the two 5' cosmids with restriction enzymes as 
indicated in Fig, 13 A and 13B and ligating each 
fragment into similarly digested Bluescript plasmid, 

coli were transformed separately with each 
recombinant plasmid and plated on LB plates with 
ampicillin. Each transformed colony was grown in 10 
mis of LB broth with ampicillin* ' Plasmid samples 
were prepared according to Ritter, et al., J. Biol, 
Chem. 265: 7900-7906 (1990). Plasmid DNA samples 
were analyzed by Southern blot analysis according to 
(Sambrook, Fritsch and Maniatis, Molecular Clonincr- 
A-Laboratory Manual , Second Edition #2) , Each one 
of the eight subclones was probed with the three 
fragments of HUG-Br2 (respectively, 1-450 bp; 454- 
666 bp; and 666-863 bp fragments representing the 
unique end of HUG-Br2) . Four Southern blots, each 
containing DNA from each of the eight Bluescript 
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subclones, were made. Each of the HUG-Br2 fragments 
(defined above) was separately hybridized to one of 
the Southern blots and the fourth blot was 
hybridized to full-length HUG-Brl to confirm that 
5 only the HUG-Br2 (1-863-bp) region hybridized to 
these subclones. Five separate clones were 
identified (1, 4, 6, 7, and 8), which hybridized to 
i the 3 separate fragments of HUG-Br2, Other 

^ subclones were all introns. Each positive subclones 

10 was further mapped in detail by restriction enzymes. 
Each subclone was digested with two to three sets of 
two enzymes each and again analyzed by Southern blot 
analysis as already described. 
^ Three sets of Southern blots were made 

^5 with each containing the sets of double-digested 

samples. The three unique fragments of HUG-Br2 were 
again hybridized to a separate blot. The now 
smaller fragment (3-5kb) which contained the intact 
exon was subcloned into Bluescript as described 
20 above. Each subclone was transformed into E. coli 
and plasmid samples prepared as described. The 
recombinant plasmid was sequenced by making nested 
deletions with an Erase-a-Base kit (Promega, 
Madison, WI) , Direct plasmid dideoxy sequencing was 
25 carried out on the deleted clones using the Sequel 
Ds/KF sequencing kit (IBI, New Haven, CT) according 
to the manufacturer's protocol and as described in 
I the reference, Ritter, et al., J. Biol. Chem. 2 65: 

7900-7906 (1990), The alignment of the HUG-Br3 
3 0 sequence with the corresponding region of HUG-Br2 

was carried out by the IBI Pustell sequence analysis 
software package. The preliminary data showing a 
^ high degree (> 90%) of homology between HUG-Br2 and 
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this short fragment of HUG-Br3 suggest that HUG-Br3 
encodes a form of bilirubin UDP-glucurono- 
syltransf erase. 

One skilled in the art would recognize 
that the above-described methodology can be applied 
to uncover all mammalian bilirubin UDP-glucurono- 
syltransferases by optimizing hybridization 
stringency conditions and DNA probe length by 
routine techniques known to one skilled in the art. 

Example X 
Diagnostic Probes 
The determination of the organization and 
the nucleotide sequence of the HUG-Brl and HUG-Br2 
gene complex has been completed using cosmid clones 
constructed from the genomic DNA of an individual 
not afflicted with defected in bilirubin 
glucuronidation. This nucleotide sequence 
information is considered to represent the normal 
genes. HUG-Brl, HUG-Br2, and the phenol (HLUGPl) 
clones have been isolated from a cDNA library 
constructed with mRNA isolated from the explanted 
liver of a Crigler Najjar Type I patient who 
underwent liver transplant surgery by using the 
unique region of each clone as a probe. The full- 
length cDNAs were sequenced using the direct plasmid 
method already described. The nucleotide sequence 
of the clones derived from the Crigler-Na j jar Type I 
patient were compared to the sequences of the normal 
HUG-Brl, HUG-Br2 and HLUGPl to determine if there 
are altered bases, deletions or insertions. The 
comparisons were made using the IBI Pustell Sequence 
software computer package. 
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The location of a modified nucleotide 
sequence was assessed at the level of the gene in 
the same patient's genomic DNA. Opposite strand 
primers (5—3') were synthesized with an 18 based 
complementary sequence to HUG-Brl, HUG-Br2 , and 
HLGUPl at positions 100 bases upstream and 100 bases 
downstream of the altered nucleotides. Using the 
specific primers, sufficient quantities of the 
target DNA from normal genome and from the defective 
genome were generated by PCR using a PerXin-Elmer 
Kit (Norwalk, CT) . The two types of amplified DNA 
as well as samples of the HUG-Brl, HUG-Br2 and 
HLUGPl cDNA previously isolated above from the 
Crigler-Najjar Type I cDNA library and the normal 
HUG-Brl and HUG-Br2 cDNA clones (as heretofore 
described) were analyzed by Southern blot analysis. 
Each blot was made in duplicate. An 18-base oligo 
reflecting the altered DNA sequence (s) in the 
Crigler-Najjar Type I DNA, as well as an 18-based 
oligo reflecting this sequence in DNA from normal 
genome were made. Each 18-base oligonucleotide 
probe was hybridized to the Southern blot prepared 
above using high stringency conditions described in 
Sambrook, Fritsch and Maniatis Molecular Cloning - A 
Laboratory M?innp.]. . One skilled in the art would 
recognize that the normal 18-base probe will 
hybridize only to the mutated cDNAs and the mutated 
PCR amplified 200-base target sequence. 

Genomic DNA from patients with Gilbert's 
disease and Crigler-Najjar Type II syndrome can be 
analyzed for altered DNA sequence (s) compared to 
normal. The DNA will be analyzed for alterations by 
first making PCR products to exonic regions of the 
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HUG-Brl and HUG-Br2 and other bilirubin genes as yet 
to be determined (e^g., HUG-Br3) . Plasmid subclones 
of the PCR products will be made and sequenced. 
Upon making comparisons to normal DNA any altered 
sequences will be subjected to the same analyses 
already described for Crigler-Na j jar Type I as 
above , 

The validity of the diagnostic probe is 
established by. the selective hybridization under 
high stringency conditions) of each probe to target 
DNA containing a perfect match. 

Whereas the clones according to the 
invention have utilized as diagnostic probes or for 
use in a diagnostic assay as described above, said 
clones would also have utility in gene therapy 
techniques for Crigler-Na j jar, Type I disease. 

***** 

While the invention has been described 
with reference to certain specific embodiments, it 
will be appreciated that many modifications and 
changes may be made by those skilled in the art 
without departing from the spirit of the invention. 
It is intended, therefore, by the appended claims, 
to cover all such modifications and changes as fall 
within the true scope of the invention. 
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CLAIMS: 



5 



1) An isolated gene locus, referred to 
as UGTl, comprising the nucleotide sequence shown in 
Figure i. 



2) A DNA segment selected from the group 
consisting of exon l of UGTIA, UGTIBP, UGTlC, UGTlD, 



f UGTIE, and UGTIF. 

3) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 2 referred to 
10 exon 1 of UGTIA. 

I 4) A recombinant DNA molecule comprising 

a a vector and said DNA segment of claim 2 referred to 

as exon l of UGTIBP. 



15 



5) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 2 referred to 
as exon l of UGTlC. 



6) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 2 referred to 
exon 1 as UGTID* 



20 



7) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 2 referred to 
■i exon 1 as UGTIE. 



8) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 2 referred to 
25 as exon 1 of UGTlF. 
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9) A DNA segment that codes for a 
polypeptide having an amino acid sequence 
corresponding to a mammalian UDP-glucurono- 
syltransferase fonn or a mutant or variant form 

5 thereof. 

10) A recombinant molecule comprising a 
vector and said DNA segment of claim 9. 

11) A DNA segment that codes for a 
polypeptide having an amino acid sequence 

10 corresponding to human bilirubin UDP- 
glucuronosyltransf erase type I. 

12) A DNA segment that codes for a 
polypeptide having an amino acid sequence 
corresponding to human bilirubin UDP- 

15 glucuronosyltransf erase type II. 

13) The human liver cDNA segment HUGBrl, 
ATCC accession under 68510. 

14) The human liver cDNA segment EUGBr2 , 
ATCC accession under 68509. 

20 15) A recombinant DNA molecule comprising 

a vector and said DNA segment of claim 11. 

16) A recombinant DNA molecule comprising 
a vector and said DNA segment of claim 12. . 
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17) A recombinant DNA molecule comprising 
a vector and said human liver cDNA segment HUG-Brl 
of claim IX- 



18) A recombinant DNA molecule comprising 
a vector and said human liver cDNA segment HUG-Br2 
of claim 14, 



19) A prokaryotic cell transformed with 
said recombinant DNA molecule of claim 10. 

20) A prokaryotic cell transformed with 
said recombinant DNA molecule of claim 15. 

21) A prokaryotic csll transformed with 
said recombinant DNA molecule of claim 16. 

22) A prokaryotic cell transformed with 
said recombinant DNA molecule of claim 17. 

23) A prokaryotic cell transformed with 
said recombinant DNA molecule of claim 18. 



24) A 
said recombinant 

25) A 
said recombinant 

26) A 
said recombinant 



eukaryotic cell 
DNA molecule of 

eukaryotic cell 
DNA molecule of 

eukaryotic cell 
DNA molecule of 



transfected with 
claim 10. 

transfected with 
claim 15. 

transfected with 
claim 16. 
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27) A eukaryotic cell transfected with 
said recombinant DNA molecule of claim 17. 

28) A eukaryotic cell transfected with 
said recombinant DNA molecule of claim 18. 

29) A diagnostic probe of at least 18 
bases for the diagnosis of a syndrome selected from 
the group consisting of Gilbert's syndrome, Crigler- 
Najjar syndrome Type I, and Crigler-Naj jar syndrome 
Type II comprising a specific sequence of a cDNA 
encoding a mammalian bilirubin UDP-glucurono- 
syltransf erase enzyme, mutant form, or variant 
thereof, said diagnostic probe identifying the 
corresponding mammalian genomic DNA. 

30) A diagnostic probe of at least 18 
bases for the diagnosis of a syndrome selected from 
the group consisting of Gilbert's syndrome, Crigler- 
Najjar syndrome Type I and Crigler-Naj jar syndrome 
Type II comprising a DNA sequence having sufficient 
homology to a cDNA encoding a mammalian bilirubin 
UDP-glucuronosyl transferase form, mutant form, or 
variant thereof to identify the mammalian genomic 
DNA corresponding to said cDNA. 

31) A diagnostic probe for the detection 
of Crigler-Naj jar. Type I syndrome, wherein said 
probe comprises a DNA sequence corresponding to the 
13-bp nucleotide deletion present in exon 2 of said 
DNA segment of claim 9. 
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32) A PGR primer pair designed to amplify 
a specific portion of raanunalian genomic DNA, wherein 
each member of said primer pair is of from 17 to 20 
bases, and further wherein each member of said 
primer pair comprises a specific sequence of -a cDNA . 
encoding a mammalian bilirubin UDP-glucurono- 
syltransferase form, mutant form, or variant 
thereof . 



10 



15 



20 



33) A diagnostic assay for a syndrome 
selected from the group consistiuy of Gilbert's 
disease, Crigler-Naj jar syndrome Type I and Crigler- 
Najjar syndrome Type li comprising the steps of : 

(a) amplifying a sequence of mammalian 
genomic DNA using a PCR primer pair according to 
claim 32 to produce multiple DNA copies of said 
mammalian genomic DNA, and 

(b) exposing said amplified DNA 
according to (a) the diagnostic probe of claims 29 
or 30 under hybridization conditions to identify a 
specific mammalian genomic sequence characteristic 
of Gilbert's disease, Crigler-Naj jar syndrome Type I 
or Crigler-Naj jar syndrome Type- II. 

34) A diagnostic assay for the detection 
of Crigler-Naj jar. Type I syndrome in a patient 
25 comprising the steps of: 

a) altering one of a pair of normal PCR 
primers such that said altered primer contains the 
deletion mutation present in exon 2 of UGTIF of the 
genome of Crigler-Naj jar , Type I patients; 

b) adding said primers to a DNA sample 
of a patient suspecting of having said syndrome; and 
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c) determining whether hybridization 
occurs such that a PGR product results, lack of said 
product indicating presence of Crigler-Na j jar , Type 
I syndrome - 

35) An amino acid sequence as shown in 
Figure 9 wherein said sequence corresponds to human 
bilirubin UDP-glucuronosyltransf erase. Type I (HUG- 
Brl) . 

3 6) An amino acid sequence as shown in 
Figure 9 wherein said sequence corresponds to human 
bilirubin UDP-glucuronosyltransf erase, Type II (HUG- 
Br2) . 
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